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THE MATRIX-FOREST THEOREM AND MEASURING RELATIONS IN 
SMALL SOCIAL GROUPS^ 



P. Yu. Chebotarev and E. V. Shamis UDC 519.172 



We propose a family of graph structural indices related to the matrix-forest theorem. The properties 
of the basic index that expresses the mutual connectivity of two vertices are studied in detail. The 
derivative indices that measure "dissociation," "solitariness," and "provinciality" of vertices are also 
considered. A nonstandard metric on the set of vertices is introduced, which is determined by their 
connectivity. The application of these indices in sociometry is discussed. 

1. INTRODUCTION 

Given a graph, how should one evaluate the proximity between its vertices? The standard distance function is 
the length of the shortest path. But is it not worth taking into account the number of paths between vertices? Which 
vertices can be considered central and which peripheral? Which graphs are dense, and which are sparse? Which 
are homogeneous? The choice of indices that express these and other structural properties of graphs depends on 
applications, more exactly, on the type of applications. This type should ideally be formulated in terms of axiomatic 
requirements on the structural indices or via modeling those concepts that should be evaluated by these indices. The 
applications are numerous; essentially, these are all applications of graph theory: transport, reliability, transmission 
of information, structural modeling, chemistry, molecular biology, epidemiology, etc. 

The application we shall focus on is one of the most difficult to formalize. It is sociology, more precisely, 
sociometry where structural indices are usually chosen heuristically. 

Sociometry studies the structure of small social groups on the basis of given relations on them. As a rule, these 
relations are binary; in some cases they are weighted. Small social groups are groups where public relations manifest 
themselves in the form of personal contacts or, simply stated, these groups are natural communities where everyone 
knows each other. The binary relations under study mainly result from sociometric interrogations. For example, 
each member of a group is asked to indicate those persons with whom she is in sympathy (or out of sympathy), or 
with whom she spends her spare time most often, or with whom she would prefer to cooperate in certain activities 
(work, rest, "exploration," etc.), or who, in her opinion, has certain characteristics. If a member « of a group indicates 
(among others) j, an arc from i to j is drawn in the digraph of the relationship. Nonoriented graphs are frequently 
included to represent objective information (contacts, collaborations, etc.). If a set of similar questions is asked or the 
respondents report their assumptions on the opinion of others (autosociometric data), multigraphs or multidigraphs 
can serve as the model. A similar approach is used in political studies where countries or parties involved in certain 
relationships are investigated. 

A lot of various kinds of relations can be studied, each requiring its own properties of the structural indices, so 
it is problematic to construct the desirable axiomatics for every application. Another approach seems more realistic: 
to collect a "library" of structural indices with specified properties, and to use those indices whose features are most 
appropriate for the relations under study. 

Traditional sociological indices are very simple. For instance, the sociometric status of the ith member of a 
group is the normalized in-degree (the number of entering arcs) of the ith vertex; the psychological effusiveness is the 
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normalized out-degree; the reciprocity of the choice of i is the normaUzed number of pairs of opposite arcs incident 
to i [1] . The density and the cohesion of a group result by averaging the sociometric status and the reciprocity over 
the group. The heterogeneity of a group is measured by the empirical variance of the sociometric status over the 
group. The imperfection of these elementary indices is caused by their local nature. In this connection, Paniotto [1] 
adduces examples of essentially dissimilar structures with the same values of the above indices and states that more 
sensitive indices that capture the topological structure as a whole and the part of each individual in this structure 
are desirable. 

A family of more sensitive indices is based on evaluating the vertex status by the sum of the lengths (or 

reciprocal lengths) of the shortest paths that lead to this vertex from all other vertices [2]. Note, however, that such 
characteristics of a group member are frequently unchanged on altering the connections between other members, 
which may not conform with the interpretation of the model. Furthermore, when characterizing the proximity of two 
members of a group, it is often worth taking into account not only the length of the shortest path between them, 
but also the numbers of paths of various lengths. 

One more idea employed in the construction of sociometric indices is to evaluate the group cohesion by the 
number of arcs (edges) in the minimum cutset, i.e., by the minimum number of connections whose removal breaks the 
connectedness of the corresponding graph. The normalized minimum number of members whose removal (together 
with their connections) makes the graph disconnected is sometimes called group stability (vitality). One problem 
(besides the computational one) with such indices is that the graph can be found to be disconnected from the very 
beginning. This situation still allows one to study the increase in the number of connected components. On the other 
hand, even for a connected graph, these indices are solely determined by its "bottlenecks," i.e., such characteristics 
are indifferent to the existence of joined subgroups with relatively poor connections between them. 

In this paper, we propose a family of sensitive structural indices and study its properties. The definitions of 
the indices are based on the matrix- forest theorem (Section 2). Section 3 is devoted to the properties of the basic 
index of vertex proximity; Section 4 discusses it and introduces derivative indices. 

The basic results of this work were stated in [3]. A topological interpretation of the vertex proximity index 
(implicitly used in [4]) was obtained in [5, 6]; an interesting further investigation of the matrix of these indices was 
undertaken in [7]; close ideas with reference to chemistry were developed in [8], where an important analogy with 
electrical networks was also formulated. In a subsequent paper, we are going to compare the structural indices 
proposed here with other ones known from the literature (see, for example, [9]). 

2. THE MATRIX-FOREST THEOREM 

The matrix-forest theorem is formulated for multigraphs and multidigraphs (which differ from graphs and 
digraphs by the possibility of multiple edges and arcs). A subgraph of a multigraph G is a multigraph all of whose 
vertices and edges belong to the vertex set and the edge set of G. A spanning subgraph of a multigraph G is a 
subgraph of G with the same vertex set as that of G. A path in a multigraph G is an alternating sequence of distinct 
vertices and edges, which starts and ends with vertices and has each edge situated between two vertices incident to 
it. Sometimes we consider a path as a subgraph of G. A forest is a cycleless graph. A tree is a connected forest. 
A rooted tree is a tree with one marked vertex, called a root. Formally, a rooted tree is a pair (T, r), where T is a 
tree and r is its vertex. A component of a multigraph G is any maximal (by inclusion) connected subgraph of G. 
Obviously, all components of a forest are trees. 

A rooted forest is defined as a forest with one marked vertex in each component. A directed path in a 
multidigraph F is defined similarly to a path in a multigraph, but here each arc is directed from the previous vertex 
to the next one in the sequence. A digraph is called a directed tree (a directed forest) if the graph obtained from 
it by replacement of all its arcs with edges is a tree (a forest). The definitions of directed rooted tree and directed 
rooted forest are analogous to the definitions of rooted tree and rooted forest (we will omit the word "directed" while 
talking about subgraphs of F). A diverging tree is a directed rooted tree that contains directed paths from the root 
to all other vertices. A diverging forest is a directed rooted forest, all of whose components are diverging trees. 

Suppose that G is a weighted multigraph with vertex set V{G) = {1, . . . , n} and edge set E{G). Let e^^ > 
be the weight of the pth edge between vertices i and j in G. This weight will be also referred to as the conductance 
of the edge. 

The Kirchhoff matrix of G is the nx n matrix L = L{G) = (i^j) with 




(1) 
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^ii ^ ^ ^ij ) ^ 1, . . . ,n, 



(2) 



where a^^ is the number of edges between i and The product of the weights of all edges that belong to a subgraph 

H oi a niultigraph G will be referred to as the weight or conductance of H and denoted by e{H). The weight 
(conductance) of a subgraph without edges is set to be 1. For every nonempty set of subgraphs Q, its weight is 
defined as follows: 

s{g) = J2 ^W- 

The weight of the empty set is zero. 

The following matrix-forest lemmas are similar to the classical matrix-tree theorems, obtained by Kirchhoff 
and some other writers in the nineteenth century (for the history, see [10]). Wc shall formulate Tuttc's generalization 
of the matrix- tree theorem to weighted multigraphs (see [11]). 

Denote by L^^ the cofactor of i^j in L. Let T{G) = T be the set of all spanning trees of multigraph G. 

THEOREM 1 (matrix-tree theorem for weighted multigraphs). For any weighted multigraph G 

and for any i,j e V{G), = e{T)- 

Tutte also obtained an analogous result for weighted multidigraphs. 

Let r be a multidigraph with vertex set V(r) = {l,...,n}, and suppose that e^j is the weight (or the 
conductance) of the pth arc from i to j in F. The Kirchhoff matrix of F is the n x n matrix L = L{T) = (^•^•) with 

"■a 

entries ^ij = — J2 ^^i ^ J h i = 1> • • • > '^i and = — ^ i^j , i = 1, . . . ,n, where aj^ is the number of arcs from 

j to i in F. Observe that i^^ is the total conductance of the arcs converging to i. The conductance (weight) of a 
subgraph of F and the weight of a set of multidigraphs arc defined analogously to the case of multigraphs. 

Suppose that T' is the set of all spanning trees of F diverging from i, and L'-' is the cofactor of in L, as 

before. 

THEOREM 2 (matrix-tree theorem for weighted multidigraphs). For any weighted multidigraph 
F and for any i,j e y(F), L'^ = e{T). 

Observe that in the directed case, entries in different rows of L may have different cofactors, but all the entries 
of the same row have equal cofactors. For simplicity, Tutte formulates these theorems only for diagonal cofactors 
L*\ The "directed" matrix-tree theorem concerning all is given in [12]. If the weights of all edges (arcs) are ones, 
Theorems 1 and 2 tell us about the numbers of the corresponding spanning trees. 

We shall now formulate the matrix-forest lemmas and the matrix-forest theorem. 

Consider the matrices 

W{G) = I + L{G) 

and 

W{T)=I + L{T), 

where / is the identity matrix. W-' (G) and W^^iT) will denote the cofactors of the (i, j)-entries of W{G) and W^(F), 
respectively. 

Suppose that T{G) = !P is the set of all spanning rooted forests of a weighted multigraph G and J^^^ (G) = 
J^^^ is the set of those spanning rooted forests of G such that i and j belong to the same tree rooted at i. Let 

W = VF(G), W^^ = W'^{G). 

LEMMA 1 (matrix- forest lemma for weighted multigraphs). For any weighted multigraph G, 

(1) detW^ = £(^); 

(2) for any i,j e V{G), W'^ = e{r^). 

Since the matrix of a weighted multigraph is symmetric, item (2) of Lemma 1 remains true if we replace 

jrij by jrji. 

Suppose that = T is the set of all spanning diverging forests of multidigraph F and J^^^^ (F) = J^^^ 

is the set of those spanning diverging forests of F such that i and j belong to the same tree diverging from i. Let 
W = W{T), W^^ =W^^{T). 
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LEMMA 2 (matrix- forest lemma for weighted multidigraphs). For any weighted multidigraph F, 

(1) detW = s{j^y, 

(2) for any i,j e ^(r), W'^ = e{r^^). 

A similar lemma can be formulated for converging forests. 
If the matrix W^^ exists, we will denote it by 

Q={qi^) = W-' = {I + L)-^ (3) 

(cither for a weighted multigraph G or for a weighted multidigraph F). Then Q = (det W)~^W*, where W* = (VV^^^ 
is the adjugate matrix of W . The matrix-forest theorem [5, 3] follows from Lemmas 1 and 2. 

THEOREM 3 (matrix- forest theorem). 

1. For any weighted multigraph G, the matrix Q = exists and q^^ = e{T^'')/e{j^), i,j = l,...,n. 

2. For any weighted multidigraph F, the matrix Q = exists and q^^ = e{j^^^^) / e{j^), i,j = l,...,n. 

If the weights of all edges (arcs) are ones, the weights of sets of spanning forests in Lemmas 1 and 2 and 
Theorem 3 are equal to the numbers of the corresponding forests. 

Lemma 2 can be derived in the shortest way from one version of Chaiken's result [13], namely, by putting 
U = W = and then U = {i}, W = {j} in the first formula on page 328 (cf. [14, Theorem 3.1]). A longer inference 
results by the sequential application of results from [15-18]. This also provides an interpretation for the inverse 
Laplacian characteristic matrix of a multidigraph. An inference of Lemma 1 from Lemma 2 is given in the Appendix, 
as well as the proofs of the following results. Another complete (i.e., not exploiting any strong theorems) proof of 
Lemma 1 for the case of equal weights of edges is contained in [6] . 

The matrix- forest theorem allows us to consider the matrix Q = as the matrix of "relative forest 

accessibilities" (in short, accessibilities) of the vertices of G (or F). These values can be used to measure the proximity 
between vertices (the "farther" i from j, the smaller is q^j). This interpretation is validated by the properties presented 
in the following section. For simplicity, these properties are formulated for nonoriented multigraphs, although many 
of them have "oriented" counterparts which can be proved similarly. 



3. PROPERTIES OF THE RELATIVE FOREST ACCESSIBILITIES 

Suppose that G is a weighted multigraph with strictly positive weights of edges, and let 
Q = (q,^) = 
be its matrix of relative forest accessibilities. 

PROPOSITION 1. For any G, matrix Q is symmetric. 

PROPOSITION 2. For any G, Q is a doubly stochastic matrix, i.e., 

(1) > 0, i,j = l,...,n; 

n 

(2) E = 1> « = 1, 

n 

(3) E Qij = 1> j = 1, 

i=l 

According to this property, q^^j may be interpreted as the fraction of the connectivity of vertices i and j in 
the total connectivity of i with all vertices. 

PROPOSITION 3. For any G and for any i,j = 1, . . . ,n such that j i, > q^j. 

This property has a natural interpretation, namely, each vertex is more "accessible" from itself than from 
any other vertex. 

PROPOSITION 4 (triangle inequality for proximities). For any G and for any i,j,k = 1, . . . ,n, 
Qij +'ltk -Ijk < Qu ■ ™ addition, i ^ j and i ^ k, then q^^ +q.^ -q^^ < q^^ . 

Consider the index 

dij = In +Qjj -Qij -Qji = Qii +Qjj -'^Qij^ i,j = l,...,n. (4) 
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ASSERTION 1. d{i,j) = dij, i,j = 1, . . . , n, is a distance function for multigraph vertices, i.e., it complies 
with the axioms of metric. 

This assertion is easily proved using the above propositions (this is left to the reader). The triangle inequality 

for proximities turns out to be equivalent to the ordinary triangle inequality for metric dij, which justifies the name 
of the former inequality. In contrast to the standard graph distance, this metric considers all connections in a graph. 

PROPOSITION 5. For any G and for any i.j = I, . . . ,n, q^^ = iff there exist no paths between i and j. 

COROLLARY. (1) Matrix Q is reducible to a block-diagonal form, where all block entries are strictly 
positive and all off-block entries are zeros. Q is strictly positive iff multigraph G is connected. 
(2) For any k € V{G), if g^^. > and q-y. > 0, then q^^ > 0. 

PROPOSITION 6. For any G and for any i, k,t = 1, . . . , n, 

(1) if tiere exists a path in G from i to k, t ^ k, and every path from i to t includes k, then q^^, > q^^. . 

(2) if (/jj. > q^f. and i k, then there exists a path from i to k, such that the difference {q^f. —qjt) strictly increases 
as j progresses from i to k along the path. 

PROPOSITION 7. Suppose that some edge weight e^^ in G increases by Asf.^ > or an extra edge 
between k and t with a strictly positive weight Ae^^ is added to G. Let G' be the new graph and W = W{G'), 
Q' = 0(G"). Then 

(1) AQ = hR, where AQ = Q'-Q,h= —— ^ , ^ = +l/AsJ-\ and R = (r,^) is the 

n X n matrix with entries r^^ = {q^^ -g^^.); 

(2) {this item and the following three are corollaries from item (1)) aii rows and aJJ columns of AQ are 
proportional, i.e., rankAQ = 1: 

(3) if q^k > Itf, then Ag,^ > iif q^t > q^^, and Aq^ < iff q^^. > q^^, 

(4) the signs of all increments Aq^- do not depend on the absolute value of Ae^.^, and tie absolute values of 
nonzero Aq^j strictly increase in As/.^ ; 

(5) for any i,j £ V{G), Ad^^ = -~\{d^k +dj^ -d^J^id^t +l/Ae^^t)~\ and therefore < d^- . 

According to item (3), if the direct connection between k and t intensifies, then the relative accessibility of 
j from i increases if and only if i and j initially were "more strongly connected" with different vertices of the pair 
(fc, t). Otherwise, it can be said that the connections in the multigraph intensify outside of most paths from i to j, 
thus the relative accessibility of j from i decreases. 

Propositions 8 and 9 are corollaries of Proposition 7. 

PROPOSITION 8. Suppose that some edge weight e^^ in G increases or an extra edge between k and t 
with a positive weight is added to G. Then 

(1) qj^f increases; 

(2) for any i = 1, . . . ,n, if there exists a path from i to k and every path from i to t includes k, then 

(3) for any i^, ^2 = 1, . . . , n, if both and can be substituted for i in the hypothesis of item (2), tiien q^ ^ 
decreases; 

(4) for any i = 1, . . . , n, if q^^ = Qui then q^ - do not alter for all j = 1, . . . , n. 

By item (3), the relative accessibility between a pair of vertices decreases when some "extraneous" connections 
appear or intensify in G. 

Let £> be a subset of vertex set V{G). We say that £> is a macrovertex in G if for all i G D, j € D, and 

The following property is among to the most interesting ones. It provides a sufficient condition for the equality 
and stability of relative forest accessibilities. 

PROPOSITION 9 (macrovertex independence). Suppose that D is a macrovertex in G and i & D, 
j eD, D. Then 

(1) <lik = 1jk' 

(2) q^f. does not alter when any new edges appear or the weights of any existing edges change inside D. 

Now we shall obtain an alternative topological interpretation of the matrix Q of relative forest accessibilities 
(the first interpretation is provided by Theorem 3). It will be demonstrated that q^j are related to the weights of 
routes of various lengths between i and j in G. To be more precise, introduce the notion of route with drains. 
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A route with drains (RWD) is an alternating sequence of multigraph vertices and edges with the following 
features: 

(1) the sequence starts and ends with vertices; 

(2) the edge located between two different vertices in the sequence is incident to them. If the same vertex 
stands in the sequence before and after an edge, it is only required that it be incident to this edge, the second incident 
vertex being arbitrary. Such an edge is called a drain. 

Routes with drains result from the usual routes by adding any number of one-edge offshoots (drains), which 
may, in particular, follow "forward" and "backward" along the original route. 

The total number of edges in the sequence is called the length of the route with drains. Set, by definition, te 
fact that for any vertex i there exist one route of length from i to i with drains and no other routes of length 0. 

The weight of a route with drains is defined as the product of the weights of all its edges (if an edge enters 
a route with drains k times, its weight is taken with exponent k). For any i = 1, . . . , n, the weight of the 0-length 
RWD from z to i is set to be 1. 

Let a* = max a, , be the maximal number of multiple edges incident to any pair of vertices in G. 

■Uev{G) 

PROPOSITION 10. For any weighted multigraph G with all weights of edges from the interval (0, (2a* (n— 
1)) ^) and for any i,j = l,...,n, 

oo 

where U-*^ and P^^^ are the total weights of all routes of length t with even and odd number of drains between 
vertices i and j in G, respectively. 

One more interpretation of Q can be obtained with the help of the Cayley-Hamilton theorem [8]. 

Instead of Q, one can use the matrices Qa = {I + aL)~^ , a > 0, which have the same properties as Q except 
for Proposition 10, where the factor a appears. The parameter a > specifies the proportions of accounting for long 
connections between vertices of G versus short ones. 

4. ACCESSIBILITY AND DERIVATIVE STRUCTURAL INDICES 

The foregoing properties of the relative forest accessibility demonstrate that it is an appropriate index of 
proximity (connectivity, accessibility) of graph (multigraph) vertices. A distinctive feature of this index is its nor- 
malization: the sum of the accessibilities of all vertices from a given one and the sum of the accessibilities of a given 
vertex from all vertices of a multigraph are equal to unity. Therefore, each ith row of the matrix Q can be treated 
as a probability distribution (or shares of a certain resource) somehow related to the vertex i. In which cases is such 
a normalization necessary? Consider two examples. 

Suppose that the members of a group collect information from the environment and exchange it with each 
other, the intensity of the exchange being specified for each pair. Every participant transmits not only the information 
collected on her own, but also that received from the others. It is required to ascertain which fractions of the 
cumulative information received by the ith participant were initially collected by each member of the group. In this 
example, information can be replaced with, for example, influence or material resources. The principal feature is the 
distribution of some resource related to a certain vertex, over all vertices. 

The second example is a variant of a children's "ring" game in which the ring may successively be passed 
many times, and this is done secretly, not before the players' eyes. If the pairwise transfer probabilities are specified 
for all players along with the temporal parameters of this random process (which is a Markov process in the simplest 
case), one can take an interest in the ring's location probabilities at every moment, provided that its starting location 
was at vertex (player) i. The main feature of this example is the presence of probability distributions related to each 
vertex. 

In the above examples, if an adequate mathematical model is stated, the result is precise, not heuristic, and 
there is no need to select it being guided by "good properties," such as those given in the previous section. It turns 
out, however, that for both examples there are rather natural models (we intend to describe them and compare 
them with other models, e.g., [9, 19], in our next paper), which lead to relative forest accessibilities. This means, in 
turn, that even when there is no cietailcd model, only the intensities (or probabilities) of pairwise interactions being 
known, the relative forest accessibilities provide a comprehensible first approximation for the required values. 
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Now we turn to derivative structural indices. The value 

can serve to measure the solitariness of the ith member of a group. Now a number of other indices can be constructed 
in the usual fashion. Specifically, the mean solitariness over a group, 




i=l 



indicates the extent of its dissociation. The empirical variance of the solitariness evaluates the heterogeneity of a 
group. The ratio of g^^ to p (or their difference) measures the provinciality of the ith member of a group. Equation (4) 
introduces a specific distance between the members of a group (Assertion 1 in Section 3). The properties of all these 
indices are determined by those of the relative forest accessibilities studied above. 

Notice, in conclusion, that there exists a certain relation between the problem of centrality (respectively, 
provinciality) evaluation and the problem of estimating the strength of players from incomplete tournaments. In the 
latter case, an "object-object" matrix is processed as well, but its entries express the results of paired comparisons 
(e.g., games or comparative preferences) rather than personal choices within a group. The problem of scoring from 
paired comparisons has been investigated a little bit better (but also insufficiently). It is worth noting, for example, 
that the work [20] was accepted as relevant in the literature on paired comparisons, though it was concerned with 
sociometric data. And conversely, sensitive scoring methods for preference aggregation can be considered with 
reference to sociometric data. A review of these methods can be found in [21]. 



APPENDIX 



Proof of Lemma 1. Lemma 1 is reducible to Lemma 2, since for every multigraph G, the corresponding 
multidigraph T can be introduced by replacing every edge of G with a pair of opposite arcs with the same weight 
each. The matrix W (and thus Q) is the same for G and F, so the desired statements of Lemma 1 follow from the 
existence of a natural one-to-one correspondence between all spanning rooted forests in G and all spanning diverging 
forests in F. 

Proposition 1 follows from the symmetry of W. 

Proof of Proposition 2. Item (1) follows from Theorem 3 and the positiveness of edge weights. 
Item (2) immediately follows from the fact that W = satisfies the same condition [8, 6]. Another 
easy proof is provided by Theorem 3 and the fact that for any 1-^,12,3 € V{G), i-^ ^ 12 J^^-' CiJ^^^ = and 

n 

U J^'i = T- 
j=i 

Item (3) follows from item (2) and Proposition 1. 

Proof of Proposition 3. Note that for any i, j = 1, . . . , n such that j i and for any H ^ J^, if H ^ jF*-' 
then H e T'' . Therefore, jr^ c jr". Let be a subgraph of G such that V{Fq) = V{G) and E{Fq) = 0. Then 
i^o e J^'-'^T'^ and e{F^) = 1, i.e., T'^ C jr" and e{T'^) < e{T"). By Theorem 3, g.^ > g.^.. 

Proof of Proposition 4. li i = j 01 i = k then, obviously, 

lij ~^1ik ~1jk = la ■ 

Assume that i ^ j and i ^ k.hi the same way as in the proof of Proposition 3, we have 
and hence 

e{T'' U.F*'=) ^ e{T'^) + e{T'^) - e{T'' f^T''') < e{J^"). (5) 

Define T'^'' as J-y n jr'*:. Observe that P^'' differs from p''' = J^^'nj^^'' only by the roots in the trees 
containing i, j, and k simultaneously. Therefore, 

e{J^'' n T''') = s{T'''') = e{T'''') < e{P''). (6) 

Inequalities (5) and (6) imply 
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and, by Theorem 3, 

Proposition 5 follows directly from Theorem 3. 

Proof of Proposition 6. Item (1) . Note that H e implies H &J^^. On the other hand, jT'' \ jT* ^ 
and e(^''= \^'*) > 0. Hence by Theorem 3, q^^. > q^^. 
Item (2). By virtue of Eq. (3), 

{I + L)Q = I. (7) 

Rewrite (7) componentwise for entries ik and it of the matrix {I + L)Q. Using Eqs. (1) and (2), the notation 
= and i ^ which follows from Proposition 3, we get 

9ik = '^^ijiljk ~(lik)-> 
lit = Xl^yCSjt -lit)^ 

lik -Qit = ^^ij[iljk -Qjt) - (Qik -lit)]- 

Then, since q^^ —q^^ > 0, there exists j ^ i such that e^j ^ (and thus (ij) G E{G)) and qj^. —q^^ > q^^ —q^^ 
(recall that the case s^j < is excluded). 

Applying this argument to vertex j instead of i, and so forth, and taking into account that no vertex in the 
path thereby constituted may coincide with any previous one and that i ^ k, we finally obtain k as the terminal 
vertex of this path, as desired. 

Proof of Proposition 7. Let AW = W'-W. Note that AW = XY, where X ^ (x^i), i 1, . . . , n, is 
the column vector with entries Xy.-^ = 1, x^Y = —1, and x^-^ = for a.\\ i ^ k, i ^ t\ Y = (j/y), j = 1, . . . , n, is the row 
vector with entries y-^j. = Ae^^, y■^^^ = —As/.^, and j/y = for all j k, j ^ t. According to [22, Sec. 0.7.4], 

It is straightforward to verify that (— iqrpgx) — ~^/^^kt ^^'^ QXYQ = —Asf^^R, and thereby item (1) is proved. 
Items (2) through (5) follow from item (1) and the nonnegativity of df.^ (see Proposition 3 or Assertion 1). 

Proof of Proposition 8. Item (1). By Proposition 3, > and > g^^., and hence item (3) of 
Proposition 7 implies Ag^j > 0. 

Item (2). Setting Q' = Q{G'), by item (1) of Proposition 7 we have 

Ag^t -Ag^fc = h{qik -qit){<ltt -Qtk) - Klik -<}it)i'lkt -<lkk) 

= KQik -<lit){9kk +Qtt -Qtk -Qkt) = Hlik -^it)dkt ■ 

Now the desired inequality follows from item (1) of Proposition 6 together with Assertion 1. 

Item (3). By item (1) of Proposition 6, g^^^ > q^^^ and g^^^ > q^^^, and by item (3) of Proposition 7, 
Ag,,,, < 0. 

Item (4). By item (1) of Proposition 7 we have Ag^^ = h{q^^. —qii){qji ^Qjk) = 0- 
Proof of Proposition 9. Consider the graph G" on the vertex set V{G) such that 

(1) {ij) e E{G') iff i ^ j and ^ 0, and 

(2) for every edge (ij) G EiG'), 4. = ^£^^. 

Let Q' = Q{G') = {q'ij). Obviously, D is a macrovertex in G" as well as in G. Let S = V{G)-\D. First, 
we prove Proposition 9 for G' . Consider the graph G" resulting from G' by deleting all edges inside D. Let 
Q" = Q(G") = {q'lj)- AH vertices of D are symmetric in G"; therefore, gf^. = g"^, for any i, j € D, k € S. Then using 
item (4) of Proposition 8, by means of induction we get g^^ = g^^ = g"^, = q'j,., for all i,j & D and k G S. This proves 
Proposition 9, since Q' = Q. 

Proof of Proposition 10. Expand Q = {I — {—L))~^ as the sum of an infinitely decreasing geometric 
progression using the notation M — (m^j) = —L: 

Q = {I-M)-^ =I + M + M'^ + ... . (8) 
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This expansion is valid if and only if 



|Ai I < 1, (9) 

where W \ is the spectral radius of M = —L [22, Corollary 5.6.16]. 

Consider the upper bound of |A|jjjax provided by the Gersgorin theorem (see [22]): 

n 

|Ai| < max ^14 -1. (10) 
Let e^ax = , Eij, where s^j = J2 4 = ^^^^ ^^'^ (^)' 

n 

i^g, E l^ii I = 2 i^f^„ E l^ii I ^ 2 mg^ ^ a*e^^ = 2a* (n - 1)£^,, . (11) 

i=i j^^i 

Consequently, the fulfillment of (9), and therefore of (8) is assured by 

ema.< (2a*(n-l))"'- 
By virtue of (8), it suffices to prove that 

m^f = t/<f ^ - z,j = l,...,n, fc = 0,l,2,..., (12) 

where m^j , i, j — 1, . . . ,n, are the entries of M . 

Let us apply induction on the length k of the roots with drains between i and j. The proof can be used with 
no change for the case of digraphs, because it does not use the symmetry of M. 

1°. k = 0. Equation (12) is valid because Af" = / and by the definition of root with drains, for every 
i,j = l,...,n, j ^i, = 1 and P^^ = P^^ = U^^ = hold. 

2°. Let (12) be valid for k = v. Prove it for k = v + 1. Consider an arbitrary route /x of length v + 1 
with g drains between vertices i and j. Let t be the next to last vertex of /x. lit^ j, then ^ is representable as 
the combination of a route of length v with g drains from i to t and an edge [tj). Otherwise, t = j, and fi can be 
considered as the combination of a route between i and j with g — 1 drains and an edge incident with j (this edge is 
the gth drain). Therefore, 

it"^^ = E^^-*.+E^W- 

Then 

_ ^+1) ^ ^ ^i,)^^^ ^ ^ ^)^^ - E - E t^St 

t¥'o tjij t^j t^j 

= E(f^^ - - E(^^ - pS'h^ = E -it E ^* 

tT^j tT^J tjij tjtj 

n 

where transition (1) is carried out by the induction hypothesis, and (2) uses the equality m^j = — X] ^jt which 
follows from Eq. (2) using M = —L. Proposition 10 is proved. 
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